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Abstract 



We study the scalar mass matrix of general supersymmetric theories 
Q\ \ with local gauge symmetries, and derive an absolute upper bound on the 

lightest scalar mass. This bound can be saturated by suitably tuning 
the superpotential, and its positivity therefore represents a necessary and 
sufficient condition for the existence of metastable vacua. It is derived by 
looking at the subspace of all those directions in field space for which an 
arbitrary supersymmetric mass term is not allowed and scalar masses are 
controlled by supersymmetry-breaking splitting effects. This subspace 
includes not only the direction of supersymmetry breaking, but also 
the directions of gauge symmetry breaking and the lightest scalar is in 
general a linear combination of fields spanning all these directions. We 
present explicit results for the simplest case of theories with a single local 
gauge symmetry. For renormalizable gauge theories, the lightest scalar 
is a combination of the Goldstino partners and its square mass is always 
positive. For more general non-linear sigma models, on the other hand, 
the lightest scalar can involve also the Goldstone partner and its square 
mass is not always positive. 



1 Introduction 

It has been known since the early days of supersymmetry that the spontaneous 
breaking of supersymmetry allows to split the masses of bosons and fermions but not 
to achieve totally arbitrary mass matrices. In general, these mass matrices consist 
of a supersymmetric contribution that is common to all the states of a multiplet 
plus a non-supersymmetric contribution splitting the masses of these states within 
each multiplet. There are then two sources of constraints in such mass matrices, 
which lead to two different kinds of restrictions. 

The first source of constraints is that the various non-supersymmetric contribu- 
tions to the masses are correlated among each other. A simple consequence of these 
correlations is expressed by the celebrated sum rule constraining the supertrace of 
the full mass matrix. When computing this quantity, the supersymmetric contribu- 
tions to masses drop out and the non-supersymmetric contributions combine into a 
remarkably simple result. This then constrains to some extent the relative masses 
that can be achieved for bosons and fermions, and has important implications in 
phenomenological model building. More precisely, the supertrace of the mass ma- 
trix vanishes for renormalizable anomaly-free theories [1], whereas it depends on the 
Ricci curvature of the scalar manifold, the derivatives of the gauge kinetic function 
and the trace of the gauge symmetry generators in more general non-linear sigma 
models [2]. Similar results also hold true in supergravity theories. Finally, in theo- 
ries with extended supersymmetry these results become even stronger. For instance, 
in theories with rigid N=2 supersymmetry, the supertrace of the mass matrix always 
vanishes [3]. 

The second source of constraints is that some of the supersymmetric contribu- 
tions to masses are fixed by symmetry arguments, and cannot be freely chosen by 
adjusting the superpotential. Most importantly, the supersymmetric contribution 
to the mass of the Goldstino chiral multiplet must vanish, since the fermion of this 
multiplet is constrained by Goldstone's theorem to have vanishing mass. As a re- 
sult, the two scalar partners of this fermion have masses that are entirely controlled 
by splitting effects. Similarly, the supersymmetric contribution to the mass of the 
vector multiplets is fixed by the values of the gauge symmetry transformations, 
since the vector boson masses arise through the Higgs mechanism. As a result, 
the real scalar partner of each massive gauge boson has a mass that differs from 
the gauge boson mass only by splitting effects, and this can also be viewed as the 
statement that the would-be Goldstone chiral multiplet has a constrained mass in 
the supersymmetric limit. A simple consequence of these restrictions is that there 
exists an upper bound to the mass of the lightest scalar, even if the superpoten- 
tial is freely tuned. The case of theories with only chiral multiplets and no gauge 
symmetries is well understood. What matters in this case is the two-dimensional 
sub-block of the scalar mass matrix restricted to the two Goldstino partners. For 
renormalizable models, the two eigenvalues of this matrix are equal and opposite, 



and the best situation that can occur is that both vanish. This implies the pres- 
ence of two pseudo-moduli fields with vanishing mass, which actually represent flat 
directions of the classical potential with peculiar properties [4, 5]. For more general 
non-renormalizable chiral non-linear sigma-models, one similarly finds that the two 
eigenvalues are split around an average value that is fixed by the Riemann curvature 
of the Kahler manifold, and in the best situation one has two scalars with identical 
masses given by this value [6, 7]. Similar results also hold in supergravity theories, 
and these give a useful guideline towards the ingredients that are needed to achieve 
metastable de Sitter vacua in string models [8, 9, 10]. The case of theories involving 
also vector multiplets and local gauge symmetries is more complicated and less un- 
derstood (see for example [11, 12, 13] for some simple examples). In this case, one 
should in principle look at a higher-dimensional sub-block of the scalar mass matrix 
that includes not only the two Goldstino partners but also the Goldstone partners. 
It has been argued in [14] that the presence of .D-type in addition to F-type su- 
persymmetry breaking tends to improve the situation, at least as far as the masses 
of the two Goldstino partners are concerned. But a full analysis including also the 
Goldstone partners is still missing. Finally, in theories with extended supersymme- 
try, similar but even stronger results hold true. For instance, in theories with rigid 
N=2 supersymmetry some of the Goldstino partners are unavoidably tachyonic or 
at best massless in all the situations where supersymmetry breaking is of F type 
from the N=l viewpoint, namely models involving only hyper multiplets [15] or only 
Abelian vector multiplets [16]. On the other hand, it has been argued in [17] that 
such tachyonic Goldstino partners can be avoided in more general situations where 
supersymmetry breaking is also of D type from the N=l viewpoint, like in particular 
models involving non-Abelian vector multiplets or charged hyper multiplets. But a 
general study of the masses of the potentially equally dangerous Goldstone partners 
is again missing, although some explicit supergravity examples have been studied 
in detail [18, 19, 20]. In this same context, it has also been shown in [21] that 
under certain assumptions there exists an algebraic obstruction against a consistent 
non-linear realization of N=2 supersymmetry, and it would be interesting to assess 
whether this captures the same information as the presence of tachyons. 

The purpose of this paper is to perform a detailed study of the scalar mass matrix 
of generic theories with rigid N=l supersymmetry and local gauge symmetries, 
and to derive an upper bound on the value of its lightest eigenvalue. The main 
improvement that we aim to achieve compared to previous analyses is to obtain the 
strongest possible bound, with the property that it should be possible to saturate it 
by adjusting only the superpotential. To achieve this goal, we will need not only to 
consider the effect of the vector multiplets on the two Goldstino partners, but also 
to include in the analysis the Goldstone partners, and focus our attention on the 
full dangerous sub-block of the scalar mass matrix for which supersymmetric effects 
are constrained. 



The main result that we will derive in this work is that the most dangerous scalar 
field is in general a linear combination of both the Goldstino and the Goldstone 
partners. We will moreover argue that the maximal value that the mass of this 
mode can take provides the universal upper bound on the scalar masses of the 
theory that we are looking for, with the property that it can be saturated by tuning 
the superpotential. In the simplest case where there is a single spontaneously broken 
gauge symmetry we will be able to obtain a quite explicit expression for this upper 
bound. More precisely, denoting by /' and x l the orthonormal vectors defining the 
Goldstino and the Goldstone directions in field space, and with m 2 ^ m 2 s and m 2 ^ 
the matrix elements of the Hermitian block m 2 j of the scalar mass matrix along 
these directions, this bound will be shown to be given by: 



m 2 = max 



{i(m 2 // + 2my-i V / (mJ / -2mL) 2 +8|m^| 2 |. (1.1) 



This strongest bound is always smaller-or-equal than the weaker bound maxjm 2 ,,-} 
that can be derived by looking only at the Goldstino direction, independently of the 
optimization over the choice of the vacuum point and the directions /* and x % that 
defines these bounds. In the particular case of renormalizable theories, the optimal 
choice can be clearly identified and is seen to correspond to a maximization of the 
value of the D auxiliary field of the involved vector multiplet. The bound then takes 
the very explicit form m 2 = \q max / q m i n \M 2 , where M is the mass of the gauge field 
whereas q min and q max denote the smallest and largest charges with common sign. 
In the more general case of non-renormalizable theories, the optimal choice depends 
also on the curvature of the scalar manifold and not just on the structure of the 
gauging, and can no longer be explicitly determined. It is then not possible to make 
the bound more explicit without specializing to a particular model. 

The rest of the paper is organized as follows. In section 2 we review the general 
structure of supersymmetric theories with gauge symmetries. In section 3 we de- 
scribe the form of the scalar mass matrix and study its restriction to the subspace 
defined by the Goldstino and Goldstone directions. In section 4 we derive a general 
upper bound on the lightest scalar mass, focusing on theories with a single sponta- 
neously broken gauge symmetry where the relevant matrix is three-dimensional and 
can be studied analytically. In section 5 we study the special case of renormalizable 
theories and show that in that case the lightest scalar in the optimal situation is 
always a combination of just the Goldstino partners, with a positive square mass 
depending on the charges. In section 6 we discuss the qualitative features of the 
more general case of non-renormalizable theories and argue that in that case the 
lightest scalar in the optimal situation is a combination of the partners of not only 
the Goldstino but also the Goldstone modes, with a square mass of indefinite sign 
that depends both on the curvature and the structure of the gauging. In section 7 
we present our general conclusions. Finally, in appendix A we study in some detail 
a few concrete examples of models to illustrate our general results. 



2 General supersymmetric theories 

Let us consider a generic N — 1 theory with n chiral multiplets $* and /c vector 
multiplets V a . The most general two-derivative Lagrangian for such a theory is 
specified by a real Kahler potential K, a holomorphic superpotential W, a holomor- 
phic gauge kinetic function H a b and some holomorphic Killing vectors X l a generating 
a group of isometries: 

£= [d 4 6K($,$,V)+ fd 2 9 W($) + ^H ab ($) W aa W b a + h.c. . (2.1) 



In view of taking the Wess-Zumino gauge, we can study this theory by expanding 
in powers of V a . From now on, we will then denote by K the Kahler potential at 
vanishing V a . This defines a metric g^ = Kij, a Christoffel symbol Tf, = g kl K^i 
and a Riemann tensor R^ k j = K^ k i — g p ^K ik gKji p for the scalar field geometry. 
The usual coordinate-covariant derivatives for this geometry will be denoted by V*. 
For later convenience, we shall furthermore introduce an arbitrary gauge coupling 
constant g, although this could be reabsorbed in the normalization of H a b. The 
gauge transformations then act as <5$ J = A a X t a on the chiral multiplets and as 
§V a = -|# _1 (A a - A a ) + \f hc a (^ b + A- b )V c + 0(V 2 ) on the vector multiplets. The 
former correspond to general non-linear transformations of the scalar fields involv- 
ing the functions X l a and linear transformations of the fermion fields involving the 
scalar-dependent matrices Q a l j = "^VjX*, whereas the latter correspond to the usual 
transformations of the gauge fields and gaugini involving the structure constants 
fab°- We exclude for simplicity the possibility of non-zero variations that amount 
to a non-trivial Kahler transformation, since such situations are not guaranteed to 
be compatible with a coupling to gravity and can also not emerge in low-energy 
effective descriptions of microscopic theories where the variations were strictly van- 
ishing (see [22] for a recent discussion of this point). In particular, we thus exclude 
Fayet-Iliopoulos terms. The gauge invariance of the Lagrangian then implies the 
following conditions: 

X\K % = - 2 g- x K a , (2.2) 

XiWi = 0, (2.3) 

X l a H hci = -2f a(b H c)d . (2.4) 

In addition to these conditions, one also has to impose the equivariance condition 
on the Killing vectors: 



i 



g^XlXl^-g-'fJKc. (2.5) 

The derivative of (2.2) implies that K ai = 2igX ai , which shows that — ^g^K a can be 
identified with the Killing potentials for the Killing vectors X l a . Moreover, eq. (2.5) 
guarantees that these can be chosen to transform in the adjoint representation. 



In the following, we shall for simplicity restrict to the special case where the 
gauge kinetic function H ab is constant, so that H abi = 0. This does not represent 
a very big conceptual limitation, but it leads to a substantial simplification of the 
theory. The condition (2.4) then states that the structure constants with the upper 
index lowered with the gauge kinetic function should be totally antisymmetric. This 
implies that H ab should be equal to some constant real matrix h ab proportional to 
the Killing metric of the gauge group, which in most of the cases is just the identity 
matrix. We shall then assume that 

H ab = h ab , H abt = 0. (2.6) 

We shall on the other hand retain the possibility of having a generic Kahler potential 
K and generic Killing vectors X l a defining non-constant Q a l j- The particular case of 
renormalizable gauge theories corresponds to choosing K = 5ijQ l <&\ X l a = —iT a l j& 
and Qaj = Taj, with constant T ai j. 

In the Wess-Zumino gauge, the Lagrangian for the physical component fields <p l , 
ip l , A a and A a is given by the following expression: 

C = -g i3 D li <j> i D^- ig^{p^ + Tl n Iprr) -\h ab F; u F b ^ 

- l -h ab X a p\ b + h.c. + gftWiWj + I h ab K a K b + \v t W 3 ft ft + h.c. 

+ V2gg tJ Xl^\ a + h.c. - ±R i0 ftftftft. (2-7) 

In the above expression, F° = d^A^ — d u A a ^ + gf bc a A b ^A c v is the gauge field-strength 
and D^ = 8^ + gA^Xl D^ft = d^ft - igA^ Qjj ft, D^\ a = d^\ a + gf^A^X" 
are the gauge-covariant derivatives. 

The vacuum is defined by constant values of the scalars <p l and vanishing values 
of the fermions ft, X a and the vectors A a , minimizing the energy. The values of the 
auxiliary fields F l and D a are then fixed by their equations of motion and read: 

F i = _ g iJWj, D a = --h ab K b = igh ab XlK i = -igh ab XlK- l . (2.8) 

The vacuum energy V is given by 

V = g is F i F?+±h ab D a D b . (2.9) 

The stationarity condition Vi = implies that 

V i W j F* + igX ai D a = 0. (2.10) 

Finally, one may easily compute the masses for the modes describing small fluctua- 
tions around such a vacuum. The scalar square masses are given by 

m% = g^V.WkVjWf - R l0 F k F ! + g 2 h ab X ai X bJ + g Q mJ D a , (2.11) 

m% = -V^jWk F K - g 2 h ab X ai X bj . (2.12) 



The fermion masses are instead found to be 

fJiij = ViWj , /J,ab = 0, Via = V2X ai . (2.13) 

Finally, the vector boson square masses read 

M 2 ab = 2g 2 g tJ Xi a Xl. (2.14) 

The vacuum is at least metastable if the full mass matrix for the scalar fluctuations 
turns out to be a positive definite matrix. 

The global supersymmetry is spontaneously broken whenever V 7^ 0, that is 
whenever some of the auxiliary fields F l or D a take non-vanishing values. In that 
case there exists a physical Goldstino fermion r] oc F^ 1 + -4= D a X a with vanishing 
mass fj, v = 0. Notice however that by contracting the stationarity condition (2.10) 
with the Killing vectors X l a) taking the imaginary part and finally using (2.3) as 
well as its derivative, one finds the following relation between the values of F l and 
D a at stationary points: 

Q mJ F l Fi - \g- l M 2 ab D b = . (2.15) 

Similarly, by contracting (2.10) with the auxiliary fields F l , one deduces that: 

/j ij F i F j = 0. (2.16) 

These expressions show that the basic source of supersymmetry breaking must come 
from the chiral auxiliary fields F 1 , whereas the vector auxiliary fields D a can only 
give additional effects whose sizes are linked to the masses of the vector bosons. 

The local gauge symmetries are spontaneously broken whenever M% b 7^ 0. In 
that case there exist unphysical would-be Goldstone scalars a a oc X ai ^ % + X aT ^ 1 
with formally vanishing masses m ffa = 0. But these modes are in fact absorbed 
by the gauge bosons through the Higgs mechanism, and therefore map to physical 
degrees of freedom that are massive. In the same process, the combinations of chiral 
fermions \a oc X a iip l pair with the gaugini A a to give massive Dirac fermions. 

The mass spectrum displays a rather intricate structure in the general situation 
in which both supersymmetry and the gauge symmetries are broken. As discussed 
above, the relevant complex directions defining these two breakings are respectively 
F l and X l a , and gauge invariance of the superpotential implies that these are orthog- 
onal to each other: gijF l Xl = 0. When supersymmetry is unbroken, the situation 
simplifies and can be understood in terms of multiplets. The k chiral multiplets cor- 
responding to the directions X l a are generically absorbed by the k vector multiplets 
through a supersymmetric Higgs mechanism. In a super-unitary gauge, one is then 
left with n — k chiral multiplets corresponding to the directions orthogonal to X l a 
plus k unconstrained vector multiplets. The physical square-mass spectrum then 



consists of n — k levels corresponding to the eigenvalues of the matrix g M WikWj[ re- 
stricted to the subspace orthogonal to the X l a) each containing two real scalars and 
one two-component fermion, and k levels corresponding to the eigenvalues of the ma- 
trix 2g 2 g i jX t (a Xl, each containing one real scalar, two two-component fermions and 
one three-component vector. When supersymmetry is broken, on the other hand, 
additional mass splittings are generated with respect to the above spectrum, and the 
situation becomes more complicated. But the essential modification with respect to 
the previous case is rather simple. In each chiral multiplet the two real scalars can 
split from the fermion and in each unconstrained vector multiplet the real scalar 
can split from the fermions and the vector. In addition, one linear combination of 
all the fermions must be exactly massless. 

A well-known general result about the above mass matrices that holds true 
at any point, even if supersymmetry and the gauge symmetries are both broken, 
is the supertrace of the full square-mass matrix. This is found to be given by 
str.A/f 2 = 2R i jF l F^ + 2gQ a t i D a . At a generic stationary point, one may further 
simplify this result by using the relation (2.15). In this way, one finally finds: 



strM' 



R tJ + 2 g 2 Q a \ M~ Iab Q U j FF> . (2.17) 



The value taken by the right-hand side restricts to some extent the relative values 
that bosons and fermion masses can take. 



3 Structure of the scalar mass matrix 

Let us now study more specifically the masses of scalar fields. Since the two real 
components of each complex scalar field are allowed to split, one has to consider the 
space of all the independent real modes. This can be described by 2n-dimensional 
vectors $ 7 built out of the n fields <p l and their complex conjugates 4> l : 

<// = (0* ?), 4> J =(yj • (3.i) 

With this parametrization, the quadratic Lagrangian for the scalar fields can be 
written in the following form: 

C = \gijd^d^ J - \m\jtfV , (3.2) 

with wave-function and square-mass matrices given by 

To obtain the physical masses, one can then proceed as follows. First, one choses a 
parametrization of the fields such that the wave-function g T j locally trivializes to the 



identity matrix and the kinetic terms are canonically normalized. This corresponds 
to choosing normal coordinates around the vacuum point. Next, one diagonalizes 
the Hermitian matrix rn^j to find the mass eigenvalues m%y Equivalently, one can 
consider the matrix m^j in a new basis defined by a set of vectors v z K that are 
orthonormal with respect to the metric g T j. The eigenvalues of the new matrix 
defined by all the matrix elements of m^j on the basis of vectors v T K then yield 
directly the physical masses. This is the approach that we will use. 

To make progress in our quest for an interesting bound on the physical mass 



,2 
'in 

standard results in linear algebra. The basic point is that the value of the matrix 
i 2 jj along any particular direction must be larger that m^ in . 



eigenvalues, and in particular the minimal physical eigenvalue w^ in , we will use some 

t the 
m T j along any particular direction must be larger that m^ in . A slight generalization 



of this is that the eigenvalues of any sub-block of the matrix mjj, corresponding 
for example to the subspace spanned by a set of several particular directions, must 
similarly be all larger than ra^. This means that we can find an upper bound to 
m mm by computing the smallest eigenvalue of any principal sub-matrix of rn^j. In 
general, the obtained bound improves in quality by considering larger and larger 
sub-matrices, and the exact value of rn^^ can be obtained only by considering the 
full matrix. Nevertheless, there is a well-defined limiting situation in which the 
bound derived by considering a finite diagonal block actually saturates m^ in . This 
happens when the complementary diagonal block has eigenvalues that are very large 
compared to the elements of the off-diagonal block. For this reason, to detect the 
obstructions against making m^ in large it is enough to study the mass matrix along 
those directions where its values cannot be made arbitrarily large by adjusting the 
superpotential. 

Each direction defined by a unit vector v l in the space of complex scalar fields 
(j) 1 defines a plane in the space of real scalar fields 7 , which can be described by a 
basis of two orthonormal unit vectors v{_ and v 1 defined as follows: 



v 



I 1 („.i ^\ „J f 



-j=(v l #), v I _ = -^=(tv l -tv'). (3.4) 



Strictly speaking, the vector space of all real scalar fields is a real vector space, 
and one is therefore allowed to perform only real orthogonal transformations. How- 
ever, for the problem of studying the eigenvalues of the mass matrix rri^j, which is 
Hermitian, one may also consider complex unitary transformations, because such 
more general transformations still preserve these eigenvalues. For a given complex 
direction v l , one may then also use as alternative basis the two orthonormal vectors 
Va,b = 75 (v+ T ivi.), which take the form: 

Va = («* 0) , v I B = (0 rf) . (3.5) 

From the discussion of previous section, we know that there are two kinds of 
special complex directions along which the mass matrix displays particular restric- 
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tions. These are the supersymmetry-breaking Goldstino direction F l and the gauge- 
symmetry-breaking Goldstone directions X l a . In all the other orthogonal directions, 
one can have arbitrary supersymmetric contributions to the mass. Taking these to 
be large one can then forget about these extra directions altogether, as already ex- 
plained. Let us then focus on the subspace defined by the complex directions F l and 
X l a . We already know that F % is always orthogonal to all the X l a , as a consequence of 
the gauge invariance of the superpotential. On the other hand, the X l a are in general 
not orthogonal to each other, and the matrix of their scalar products defines in fact 
the vector mass matrix. We may however perform an orthogonal transformation in 
the space of vector multiplets, to go to a basis where at the vacuum all the X l a are 
orthogonal to each other and the vector mass matrix is diagonal. The norms of the 
vectors F l and X % a define respectively the supersymmetry breaking scale y\F\ in 
the chiral multiplet sector and the masses M a of the vector fields. More precisely, 
these quantities are defined as follows: 

\F\ = y/g i jF i Fi, M a = y/2g y /g i jX i a Xl. (3.6) 

One then finds: 

g tJ F*Fl=\F\ 2 , 9l jXlXl= l -g- 2 M a M h 5 ah , g tJ F*Xl=0. (3.7) 

We may finally define the following normalized vectors: 

rpi rpi -y-i vi 

f = ; = — , xt= ; a -. = y/2g-±. (3.8) 

These form an orthonormal basis for the subspace of complex directions we want to 
study, and satisfy: 

g iJ f i f = l, g i - 3 x i a x\ = b ah , g i - 3 fx\ = ^. (3.9) 

Following our general discussion on the map between a complex direction in the 
space of complex scalars and a basis of two independent directions in the space of 
real scalars, we now introduce the following orthonormal basis of real directions: 

/+ = -j={f f) , fl = j={if -if) , (3-10) 

l - 1 ■ - 

X a+ = ~E\ X a %a) 5 X a- = ~7kV X o. ~ ' lX 'a) • (3-11) 

Alternatively, we may as already explained also use the alternative but less physical 
basis defined by 

/I = (f 0) , f B = (0 f ) , (3.12) 

<a = « 0) , x{ B = (0 x\) . (3.13) 



The directions /+ and /£ describe the two real scalar partners of the massless 
Goldstino fermion. Due to the symmetric roles of these two modes, it will in fact 
be convenient to use the alternative description in terms of f A and f B . In the limit 
of unbroken supersymmetry, the modes defined by /£ and /£ would both belong to 
the same multiplet as the massless Goldstino fermion and would thus be massless 
too. As a result, their masses can be non-zero only because of splitting effects. The 
directions x T a+ and x T a _ describe instead two different kinds of real scalars which are 
respectively the unphysical would-be Goldstone modes, which correspond to fake 
null vectors of the mass matrix that we should discard, and their partners, which 
we should instead consider. Due to the asymmetric roles of these two kinds of modes, 
it will not be convenient to use the alternative description in terms of x l aA and x J aB . 
In the limit of unbroken supersymmetry, the modes x r a _ would belong to the same 
multiplet as the massive vector bosons and would thus be massive too. As a result, 
their mass can differ from that of the gauge fields only by splitting effects. We 
thus find a total of 2 + k scalar modes which are dangerous for metastability: the 2 
modes associated to f± and alternatively described by f A B , whose masses are equal 
to zero plus supersymmetry breaking effects, and the k modes associated to x J a _, 
whose masses are equal to the gauge boson masses plus supersymmetry breaking 
effects. 

Let us then look at the mass matrix m 2 j in the (2 + A;)-dimensional subspace 
spanned by the vectors f A = (f l 0), f B = (0 f l ) and x l a _ = (ix^ -ix l a ), which 
form an ortho normal set. More precisely, we need to compute the matrix elements 



m a« = m ^j v av{, where v T a can be either f A , f B or x T a _. Exploiting gauge invariance, 
we can rewrite most of the contributions coming from the non-Hermitian blocks m?- 
and m~ in terms of the Hermitian blocks m%. Indeed, Goldstone's theorem implies 
that rn?jX J a = —m\^x? a at a stationary point. One then finds that the (2 + Tri- 
dimensional sub-matrix m 2 5 takes the form 

ap 
/ ^2 A /o»'»r,2* 



m 2 s 

ap 



A -V2im%\ 



b ff " V ^ info 

2 ff ^ m % b 



A* m 2 „- V2: 



(3.14) 



\V2im% n ~V2im% n 2m 2 J 



J -^a J -^a -^a^b 

where 

2 2 rift 2 2 fi-j 2 2 i -J /oicr\ 

m ff = m ijf f ' m fSb = m ijf X b ' m *ax b = m ijX a X J b , (3.15) 

and 

A = m 2 J fp. (3.16) 

It is important to emphasize that the above structure is completely general, since it 
depends only on the gauge invariance of the theory and not on the detailed structure 
of the masses. 
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It is a straightforward exercise to compute the entries m 2 



m 



1= and ml 



which are given by the Hermitian block m 2 j of eq. (2.11) along the directions defined 
by F l and X l a . The resulting expressions can be significantly simplified by making 
use of the stationarity condition, which holds at the vacuum, as well as the relations 
implied by gauge invariance, which hold at any point and can therefore also be 
differentiated. Most importantly, the dependence on the second derivatives of the 
superpotential can be completely eliminated. Defining the obvious notation R V w V z = 
R i j k iV t w^y k z l and Q a vw = QijV l w^ for any complex directions v\ w\ y l and z\ and 
recalling that M 2 b = M a M b <5 afe , one finds: 



m 



f,f 

2 



R 



ffff 



1 ,,2 
™~XaS b = 2 Mab ~ 



ni 



f%b 



R 



VE. 



Qcff Q 



ff *«cff 

M 2 



D r, 2 ST^ QcffQcx a x b r,2\^a-Qb)ff 

Rffx a x b - *9 l^c —o 2 9 



■fffSb 



VE. 



QcffQ 



M 2 



M 2 



\F?. 



M a M b 



(3.17) 

| 2 , (3.18) 

(3.19) 



The entry A has instead a more complicated expression, and it is not possible to 
simplify it in any relevant way by using the stationarity and the gauge invariance 
conditions. Most importantly, the dependence on the third derivatives of the super- 
potential cannot be eliminated, and varying such derivatives allows to vary A over 
the entire complex plane. Therefore: 



A = generic complex number that can be adjusted by tuning W, 



ijk 



(3.20) 



We may now ask what is the upper bound on the smallest eigenvalue of the 
above matrix m 2 3 when m 2 ?, m% and m? = are held fixed and A is freely varied. 

ap j j ' J x b x a x b j 

As already explained, this would also represent an upper bound on the smallest 
eigenvalue m 2 ^ of the full mass matrix rn 2 j. Unfortunately, this question is still 
quite complicated for generic theories with arbitrary gauge symmetries, where k 
can be arbitrarily large and it is thus difficult to study the full (2 + /c)-dimensional 
matrix. The importance of the Goldstone directions x a with respect to the Gold- 
stino direction depends however crucially on the relative size of the vector masses 
M a compared to the chiral supersymmetry breaking scale a/|-F|. When the M a are 
much larger than y|i^|, the situation simplifies substantially and the heavy vector 
multiplets can in fact be integrated out in a supersymmetric way to define an ef- 
fective theory for the light chiral multiplets. The way in which this can be done 
has been described in some detail in [23] . In particular, for the sub-sector of scalar 
fields we are focusing on, we see that all the modes associated to x^_ are very heavy, 
and their mixings with the modes associated to f T A and f^ have a negligible effect. 
The only dangerous light modes are then those associated with f T A and f T B , and the 
largest value for the smallest mass is obtained by tuning A to zero. The upper 
is then given by (3.17), up to negligible effects of order 0(\F\ 4 /M ( 



bound m^n 



al) 
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and the square bracket in (3.17) can be interpreted as the corrected Riemann curva- 
ture of the effective theory along the Goldstino direction. When the M a are instead 
comparable-or-smaller than y|i^|, the modes associated to x J a _ are a priori as light 
and as dangerous as the modes associated to ]\ and f^, and the problem acquires 
its full-fledged complication. It is this situation that we would like to study in some 
detail. 

For the sake of clarity, we shall mostly restrict our study to the simplest case of 
theories with a single U(l) gauge symmetry and k — 1. In this case, it is possible to 
extract analytically the full information and derive a simple necessary and sufficient 
bound, which can be saturated by adjusting the superpotential. In more complicated 
theories with several gauge symmetries forming a more general group G, on the other 
hand, one may get some partial analytic information by studying smaller sub-blocks 
of dimension one, two and three, and derive simple necessary but not sufficient 
bounds, which can a priori not be saturated by adjusting the superpotential. In 
particular, one may look separately at all the possible directions in the generator 
space and figure out which one leads to the strongest bound. An obvious naive 
guess for a special direction to look at in the space of generators is the direction 
d a = D a /\D\ defined by the vector auxiliary fields D a . This is also suggested by the 
fact that D a appears together with F % in the definition of the Goldstino fermion. 
When looking at the special direction x{_ = <i a a^_, some partial and interesting 
simplifications do indeed occur in the expressions (3.18) and (3.19), but since we 
were not able to reach a really simple and useful result by pursuing this direction, 
we will not comment any further on this, and restrict from now to the basic case 
involving only one symmetry generator. 

4 Bound on the lightest scalar mass 

Let us now consider the case of theories with a single U(l) gauge symmetry, where 
the index a takes a single value and can therefore be dropped. The matrix (3.14) 
is then 3-dimensional, and it turns out that it is possible to study the behavior of 
its eigenvalues in a fully analytic way. In order to illustrate the fact that the study 
of larger sub-blocks of the mass matrix leads to sharper bounds on the lightest 
eigenvalue, we shall however successively study sub-blocks of dimensions one, two 
and three. 

There are three possible principal blocks of dimension one, which correspond to 
the diagonal elements, but only two of them are independent, namely: 

m 2 ff , 2m%. (4.1) 

Both of these values represent upper bounds on m^. Which one is the smallest and 
thus leads to the strongest bound depends however on the situation. We therefore 
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conclude that a first bound that we can write is: 

m L < m fi) , m fi) = mm{m 2 ff , 2 m 2 xx } . (4.2) 



There are then three possible principal blocks of dimension two, but again only 
two of these are independent. The first possibility is the upper 2-dimensional block 
of (3.14), with two identical diagonal elements given by m 2 j and off-diagonal element 
given by A. The two eigenvalues of such a matrix are m 2 r± | A|. The maximal value 
for the smallest of these is achieved by choosing A = and is given by m 2 ?. This 
sets an upper bound on m^ in , but this bound is already contained in the previously 
derived bound (4.2). The second possibility is the lower 2-dimensional block of 
(3.14), which is given by 



m 2 - V2im% \ 
ff ix . (4.3) 

-V2im% 2 m 2 - J 

The eigenvalues of this matrix are easily computed and are given by: 



m\ = \ (m 2 f - + 2 m 2 xx ) ± \ \J {m 2 fJ - 2m 2 xx ) 2 + 8 \m%f . (4.4) 

Both of these eigenvalues set upper bounds on m 2 ^ . The smallest one that leads to 
the strongest bound is always the one with the negative sign choice. This leads to 
a new bound, which is always stronger-or-equal than the previous bound (4.2) and 
takes into account the non-trivial level-repulsion effect induced by the off-diagonal 
element m 2 x : 



< m\ 2) , m\ 2) = i {m) f + 2 m 2 xx ) - \\] {m) f ~ 2m 2 s ) 2 + 8 \m 2 -f . (4.5) 



' ' ^min — ■ ■ - i z I !zi r) 



Finally, one may try to look at the full block of dimension three, which should 
in this case yield the full information. This is given by: 



/ m 2 f A -y/2i m% \ 

2 . /K-^2 



A* ml j y/2: 



im, 



ff y^ L " L fx 
\V2im 2 fx -V2imf x 2m 2 xx ) 



(4.6) 



For generic A, the eigenvalues of this matrix are quite complicated, since they 
are determined by the roots of a cubic characteristic polynomial. However, their 
values for the optimal choice of A that maximizes the smallest of them can be 
determined analytically. To understand this, let us first recall that by the anti- 
crossing theorem of Wigner and von Neumann, one generically needs to tune two or 
three real parameters to force the eigenvalue of a real-symmetric or Hermitian matrix 
to cross. In our case, the matrix is Hermitian but due to its very special form it 
actually behaves like a real-symmetric one. In fact we know that there actually exists 
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a basis where the matrix simplifies from Hermitian to real-symmetric. One can then 
verify that its eigenvalues always cross at isolated points in the A complex plane. 
Knowing this, it becomes clear that the highest value for the minimal eigenvalue is 
obtained at such a crossing point. But since at that point two eigenvalues become 
degenerate, the cubic characteristic polynomial simplifies and it should be possible 
to solve the problem analytically. One way to derive the desired result is to start 
from the characteristic equation written after decomposing the two complex entries 
A and m 2 r s in the form of a modulus times a phase: 

( A _ m J ; ) 2 ( A _ 2m y - 4 \m%\\\ - m)j) 

-|A| 2 (A-2m 2 s ) +4|A||m 2 s | 2 cos(argA-2argmJ s ) =0. (4.7) 

Form the form of this equation, it is clear that the optimal choice for the phase of 
A is the one minimizing the last term, in such a way that the cosine is equal to —1, 
that is: 

arg A = 2 arg m j s + 7r . (4.8) 

Plugging back this expression into the characteristic equation (4.7), this simplifies 
to (\-m 2 f -+\A\)[(\-2m 2 xS )(\-m 2 ff -\A\) -4|mJ s | 2 ] =0. The three solutions 
of this cubic equation for A are now easy to find analytically and they are given by 
m 2 ff -\A\ andi(mJ / + 2m2_ + |A|)±I[(mJ / -2m2_ + |A|) 2 +16|m2_| 2 ] 1/2 . The 
optimal value for |A|, which maximizes the minimal eigenvalue, is obtained when 
the first eigenvalue crosses the smallest of the other two, which is the one with the 
relative minus sign. This fixes: 

|A| = \(m) f - 2 mL) + \^J {m 2 ff - 2m 2 xS ) 2 + 8\m%f . (4.9) 

At the optimal point defined by (4.8) and (4.9), the values of the two degenerate 
lowest eigenvalues and the highest eigenvalues are finally given by: 



™l = \(m) f + 2m 2 xi ) ± ^ (m 2 f f - 2 m 2 ^) 2 + 8 \m%f . (4.10) 

Both of these eigenvalues give upper bounds on ra 2 ^. The smallest one that leads 
to the strongest bound is as before the one with the negative sign choice. This 
leads to a new bound, which is however seen to be identical to the previous bound 
(4.5), showing that the potential level-repulsion effect that is induced by a generic 
off-diagonal element A can be trivialized by optimally choosing the value of this 
element through a tuning of the superpotential: 



2^2 2 

m min ^ m (3) > m (3) 



i(m 2 / + 2my-i V / (m 2 // -2m 2 s ) 2 +8|m 2 s | 2 .(4.11) 



Summarizing, we have managed to find explicit expressions for the upper bounds 

2 2 

(1)' m (2)> m (3) 
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m 2 ,-., m 2 2) , m 2 n\ on the lightest mass that descend from blocks of dimension 1, 2, 3 



As expected, these are increasingly strong and satisfy: 

"»(!)> m( 2) > 771(3) . (4.12) 

These bounds hold however for a fixed theory at a fixed vacuum. In particular, they 
depend on the direction /' and on the vacuum coordinates 0*, which determine the 
direction x % and the values of R^T and Qij. We may then derive a more useful 
and universal bound by further optimizing the superpotential W to maximize the 
smallest mass. The strongest version of this fully optimized bound, which is our 
main result, then takes the form 

m 2 min <m 2 , (4.13) 

where 



m 2 = max 



| \ (m 2 ff - + 2 m y - \ ^(m 2 ff --2m 2 xS ) 2 +8\m%\ 2 } . (4. 14) 



More precisely, the optimization of W defining (4.14) can be performed as follows. 
At any given point one can adjust n — 1 independent complex first derivatives Wi, 
n(n — l)/2 independent complex second derivatives Wij, and (n — l)n(n + l)/6 
independent complex third derivatives Wijk, compatibly with gauge invariance. One 
may then tune the n — 1 Wi to adjust the direction /' and the scale y|i^|, n — \ 
of the W^ to adjust the values of n — 1 of the fields 0* compatibly with the n — 1 
stationary conditions in the non-Goldstone directions, and finally 1 of the W^k to 
adjust the quantity A to its optimal value. In this optimized situation, however, 
there is still 1 combination of fields <p l related to the vector mass M 2 = 2g 2 \X\ 2 
that cannot be freely adjusted, because the stationarity condition (2.15) along the 
Goldstone direction does not depend on W^ and W^k- As a result, (2.15) represents 
a relation between the scales y\F\ and M, for given gauge coupling g. One may 
however still imagine to tune the real gauge coupling g to achieve any desired value 
of \/\F\ and M compatibly with this real stationarity condition. Notice finally that 
after the above optimization procedure we are left with (n — l)(n—2)/2 free complex 
W^ and (n — l)n(n + l)/6 — 1 free complex W^k- This is more than enough to be 
able to decouple all the n — 2 complex scalar fields that occur in addition to the 
Goldstino and the Goldstone partners. The simplest possibility is to take the left- 
over W^ to be large and the left-over W^k to be moderate, so that all these extra 
scalars become very massive and do not induce any sizable negative level-repulsion 
effect on the masses of the Goldstino and Goldstone partners. This shows that the 
bound (4.14) can indeed always be saturated by a last tuning of the superpotential. 

5 Renormalizable gauge theories 

Let us illustrate the implications of our result in the simplest case of renormaliz- 
able gauge theories with a single U(l) gauge group, where the Kahler potential is 
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quadratic and the Killing vector is linear: 

K = 5 i3 &¥ , X { = -i qi ¥ . (5.1) 

In this situation, Qij = qiSij. Moreover, one finds Ki = Sijft and K l = <p l . It then 
follows that X % = —iQ l jKK Thanks to this last property, and calling Q~ ll j the 
inverse of Q l j restricted to the subspace of non- vanishing charges, one may write: 

D = gQ^X i X\ (5.2) 

M 2 = 2g% J X i X 3 . (5.3) 

In this simple situation, the scale of the D auxiliary field is related in a very 
simple and direct way to the mass scale M. Indeed, it follows from the above defi- 
nitions that D = \g~ l Q~^M 2 . Moreover, the condition (2.15) holding at stationary 
points reads in this case Qfj\F\ 2 = ^g~ 1 M 2 D. Using the above relation for D, and 
assuming that Qfj ^ 0, this further implies that \F\ 2 = \g~ 2 Q^(Q / /) _1 M 4 . From 
these relations, we see that stationary points are possible only if 

Q&Qff > o • (5-4) 

Moreover, the values of the overall \F\ and of \D\ are related to M and their ratio 
is fixed in terms of the values of Q % j along the directions f l and x l : 

\D\ = ±g- 1 \Q:^\M 2 , (5.5) 

\F\ = \g- 1 ^Q-MQff)- 1 M 2 . (5.6) 

D 



F 



QxxQff- (5-7) 



When instead Qfj = 0, eq. (2.15) implies that \D\ = 0, whereas \F\ and M can be 
arbitrary. This is the only situation where M can be adjusted independently of \F\. 
Notice that we may write down the following simple bound on the relative im- 
portance of ZJ-type and F-type supersymmetry breaking, in terms of the pair of 
charges g min and g max which possess the largest possible ratio with the constraint 
that they have the same sign [11]: 



< 



Qn 



Qn 



(5. 



This bound can be saturated by choosing the directions /' and x % to be the eigen- 
vectors of Q l j corresponding to the eigenvalues g max and g m i n . 

The scalar masses (3.17), (3.18) and (3.19) undergo two relevant simplifications. 
The first is that all the curvature terms drop, since in this case the scalar manifold 
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2 _ 
xx 



is flat. The second is that due to the relation (5.6) the supersymmetric term in m 
is forced to be of the same order of magnitude as the non-supersymmetric terms. 
One then finds the following simple expressions: 



m 



m„ 



m 



2 

ff~ 


Qxx Qff 


M 2 , 


2 -I 

XX r\ 


i + Qxx Qxx + QxxiQff) Qfj 


2 

fx ~ 


\ 


'xx Qfx 


M 2 . 



M 2 



(5.9) 
(5.10) 
(5.11) 



We observe now that by the restriction (5.4) and some simple linear algebra, we 
can get some useful constraints on the various pieces of these masses. In particular, 
we have that Q~ x \ Q f j > and Q x \{Qff)~ 1 Q 2 j > Q x \ Qff > 0, since Q 2 j > (Qff) 2 - 
Moreover, Q~ XX Q XX has indefinite sign but becomes equal to 1 whenever x % is an 
eigenvector of Q l j, and Q xx Q f x has indefinite sign but becomes equal to whenever 
either /* or x l is an eigenvector of Q l j. 



In this class of models, the masses m 2 f F, m xx 



and m 2 x depend on the vacuum point 



only through the orientation of the direction x l and the size of M. Moreover, by 
varying the vacuum point at fixed M one may achieve all the possible orientations 
for x l , thanks to the simple linear form of X 1 and quadratic form of K. The 
optimization of the superpotential defining the bound (4.14) then amounts in this 
case to optimizing the orientation of the directions /' and x l , with the only constraint 
that they should be orthogonal. There is then a natural guess for the optimal choice 
of /* and x 1 . This consists in choosing these two orthogonal directions to be the 
eigenvectors of Q l j with largest and smallest eigenvalues with common sign, namely 
g max and g min . With such a choice, m 2 =• is maximal, m 2 x vanishes and 2 m 2 xx is larger 
than m 2 f . The precise values are 



7/' 



in 



ff 



->■ 



M 2 



m„ 



-)• 



^5 



M 2 



m%->0. 



(5.12) 



and m 2 3 -. 



all coincide with the maximal 



With this choice, one gets that m 2 ^, mjL 

possible value of m 2 j. This value certainly represents the maximal possible value 

for mjL taken on its own. But then it must necessarily represent also the maximal 



possible value for mjL and m 2 ^, because by construction one has m 2 ^ > mjL > rajL 
for any choice of /* and x l . This proves that the above choice for f l and x l is indeed 
the optimal one, and the bound (4.14) thus reads in this case 



m 



9n 



q» 



M l 



(5.13) 



Notice finally that the optimal configuration corresponds in this case to the one that 
maximizes the size of the D auxiliary field relative to the F auxiliary fields: 



D 



-)• 



Qn 



(5.14) 
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It is worth emphasizing that in models where the superpotential is adjusted 
to reach the optimal situation described in the previous paragraph, the super- 
symmetry and gauge-symmetry breaking scales are necessarily comparable, so that 
F ~ g~ l M 2 . In such a case the vector multiplet plays an important role in the low- 
energy dynamics and gives a sizable contribution to supersymmetry breaking, with 
D ~ g~ x M 2 ~ F. The average mass of the Goldstino partners can then be as big as 
m 2 ~ gD ~ M 2 . On the other hand, in models where the superpotential is instead 
such that there is a hierarchy between the supersymmetry and gauge-symmetry 
breaking scales, so that F <C g^M 2 , the situation can never be optimal. In such 
a case the vector multiplet has a small impact on the low-energy dynamics and 
gives a small contribution to supersymmetry breaking, with D ~ gF 2 /M 2 <C F. 
The average mass of the Goldstino partners can then still be non-zero and posi- 
tive, but it is necessarily much smaller than the above-mentioned maximal value: 
m 2 ~ gD ~ g 2 F 2 /M 2 <C M 2 . This is what happens for instance in models where 
the gauge symmetry is broken by large expectation values for scalars along an al- 
most flat direction, like for example the one described in [24] . In this kind of models 
the heavy vector multiplet can actually be integrated out in a supersymmetric way, 
and the fact that the most dangerous mode is related just to the Goldstino direction 
is then obvious from the beginning. A concrete example of this sort, with a mass 
spectrum that indeed displays the above features, is discussed in some detail in [25] . 

Summarizing, we see that in the case of a flat scalar manifold and a linear 
isometry, the lightest scalar field is identified with a partner of the Goldstino, and 
its square mass is positive. In this particular case, one would thus have obtained the 
same bound by looking only at the Goldstino partners and maximizing the smallest 
of their masses by making the effect of the gauging as large as possible. This is 
however an accidental feature of these models, which is due to the flatness and 
maximal symmetry of the space, as well as the fact that there is a single generator. 
In next section we will show that in the case of curved scalar manifolds, the situation 
is no-longer so trivial. 

6 Non-linear gauged sigma models 

Let us next consider the more general case of effective theories with a non-trivial 
Kahler potential and a single U(l) gauge symmetry generated by a Killing vector 
of unspecified form: 

K = K(&&) , X i = X i (¥). (6.1) 

This situation is of course much more complex than the simple particular case con- 
sidered in previous section. Yet one may try to follow the same steps as before. 
A major difference is hat since the Killing vector X 1 is not linear and K is not 
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quadratic, X 1 and IP are no longer linearly related through Q l j. One may how- 
ever introduce the new quantity Q l j = iX % Kj / \K m K m ) , which allows to write the 
relation X 1 = —%Q % jKK In the case of renormalizable gauge theories with a phase 
symmetry, Q l j coincides with Q l j and is constant, but in the more general situation 
considered here Q l j differs from Q l j and is not constant. With this notation, and 
calling Q~ ll j the inverse of Q l j in the subspace where it does not vanish, one can 
then write: 

D = gQ;}X i X\ (6.2) 

M 2 = 2g 2 g tJ X i Xi. (6.3) 

In this more complicated case, the auxiliary field D is again related to the mass 
scale M, but in a more involved and implicit way. Indeed, from the above definitions 
one deduces that D = \g~ x Cf^.M 2 . Moreover, the condition (2.15) implies that 
at a stationary point Qfj\F\ 2 = ^g~ 1 M 2 D. Using the above relation for D, and 
assuming that Qjj ^ 0, this further implies that \F\ 2 = \g~ 2 Q~^ b (Qfj)~ 1 M A . From 
these relations, we see that stationary points are possible only if 

Q&Qff > o • (6-4) 

The values of the overall \F\ and of \D\ are again related to M and their ratio takes 
as before a simple form, but now these relations depend not only on Q l j but also 
on the new quantities Q l j, taken respectively along the directions /* and x l : 

\D\= l -g- 1 \Q-^M 2 ) (6.5) 



\g-\lQMQff)- l M 2 . (6.6) 



2 J y (,r.r-~i,ff 

D 



F 



Q-JkQff- (6-7) 



When instead Qfj = 0, eq. (2.15) implies that \D\ = 0, whereas \F\ and M can be 
arbitrary. As before, this is the only situation where M can be adjusted indepen- 
dently of |F|. 

In this case, the relative importance of .D-type and F-type supersymmetry break- 
ing depends on the vacuum point not only through the direction x l but also through 
Q l j and Q l j. Finding an explicit and quantitative bound on their ratio is then more 
difficult. See for instance [26] for some attempts. Nevertheless, from the above re- 
lations one may still infer a simple although somewhat implicit bound that involves 
the maximal eigenvalue Qmax of Q l j and the minimal eigenvalue of Qmin of Q l j, with 
the constraint that these should have the same sign: 
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In general, this bound can however not be saturated, because Q l j and Q l j are 
different matrices that cannot be diagonalized simultaneously, and it is therefore 
not possible to choose the orthogonal directions /' and x 1 in such a way to get 
simultaneously Q f j = Q ma , x and Q xx = Qmin- 

The masses (3.17), (3.18) and (3.19) can now be computed more explicitly. In 
this case there is an additional contribution coming from the curvature. As before, 
the relation (6.6) allows to rewrite the non-supersymmetric pieces in terms of the 
same scale as the supersymmetric piece. One then finds the following expressions: 



m h = 

2 _ 1 
m xx ~ 9 



m /- 



■\g- 2 M 2 R fIff Q^{Qf f T l + Q- x \ Q ff \ M 2 , (6.9) 

i - l - g - 2 M 2 R fIxx Q- 1 x (QffT 1 + Qll Q x - X + QzUQffT'Qh] m 2 ,(6.io) 

- \g- 2 M 2 R fff , Q- x 1 x (Q fI y 1 + Q~ x \ Q fx ] M 2 . (6.11) 



There are again various restrictions on the ingredients appearing in these expres- 
sions. Concerning the contractions of Qij and Qij, the restriction (6.4) implies as 
before useful constraints. In particular, we have Q^Qff > and Q x \{Qfj)~ 1 Q 2 f f > 
Qx\ Qff — 0- Moreover, Q x \ Q xx is indefinite and deviates from 1 even when x is an 
eigenvector of Q l j , whereas Q xx Q f x has indefinite sign but becomes as before equal 
to whenever either /* or x % is an eigenvector of Q % j. Concerning the contractions 
of Ri0, on the other hand, there does not seem to exist any sharp inequality. 

In this class of models, the masses m 2 f, m xx and m 2 , x depend on the vacuum 
point not only through the orientation of the direction x l and the size of M, but also 
through the values of R^j, Qij and Qij, which are in general not constant. Moreover, 
it is no longer granted that by varying the vacuum point at fixed M one may 
achieve all the possible orientations for x l . The optimization of the superpotential 
defining the bound (4.14) is then a complicated task, and does not simply amount 
to optimizing the orientation of the directions /* and x l . Moreover, even ignoring 
this difficulty, finding the optimal choice is more involved also because of the fact 
that generically it emerges from a competition between the terms that depend only 
on Q^ and Q^ and those that depend also on Rij k f, although there may be regimes 
where one or the other of these two contributions dominates. As a consequence of 
this, we were not able to find any general result for this type of models based on 
curved geometries. We however studied in some detail a few particular examples 
in appendix A, based on simple geometries with covariantly constant curvature and 
simple isometries. The only few remarks that can be made in general concern the 
behavior of the various contractions that appear in the masses m 2 f, m xx and ml x 
when the directions f l and x l are varied. To get an idea of what may happen, 
we may treat f l and x l as arbitrary directions and enforce the constraints that 
9ijf l P = I; 9ij% l & — 1 and gijf l x^ = through Lagrange multipliers. Proceeding 
in this way, one then finds the following results. When Q^j is extremal Qf x = 0, 
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when Q x \ is extremal Q f \ = 0, when QffQ^ is extremal QfsQJx + QffQ/x = 0> 
and finally when -R//// is extremal Rjffs — 0. 

Summarizing, we see that in the case of a curved scalar manifold and a generic 
isometry, the lightest scalar field is generically identified with a linear combination 
of Goldstino and Goldstone partners, and its square mass is not necessarily positive. 
In this case, one would thus have obtained a too optimistic bound by looking only 
at the Goldstino partners and maximizing the smallest of their mass. Notice finally 
that the optimal situation does not necessarily correspond to the one that maximizes 
the effect of the gauging. 

7 Conclusions 

In this work, we have shown that it is possible to derive an absolute upper bound 
on the mass of the lightest scalar field of a theory with spontaneously broken su- 
persymmetry and local gauge symmetries. This can be obtained by focusing on the 
subset of scalar fields corresponding to the partners of the Goldstino fermion and 
the gauge vector bosons, for which the mass is constrained by symmetry arguments. 
The resulting bound has the property that it can be saturated by adjusting the 
superpotential. Requiring it to be positive is therefore a necessary and sufficient 
condition on the remaining functions specifying the kinetic terms for the existence 
of a metastable supersymmetry-breaking vacuum. We have shown that by including 
also the Goldstone partners one finds in general a stronger bound than by consid- 
ering just the Goldstino partners, and we have illustrated this fact through several 
explicit examples. 

Our result has interesting implications on the conditions for the existence of 
metastable supersymmetry-breaking vacua in generic supersymmetric theories with 
local gauge symmetries. Indeed, the region of parameter space where tachyons can 
be avoided is reduced when one considers not only the Goldstino partners but also 
the Goldstone partners, since there are points where the former have positive square 
mass while the latter or linear combinations of the two have negative square mass. 
We believe that there may in fact exist models where the upper bound derived from 
just the Goldstino partners is positive whereas the upper bound derived by including 
also the Goldstone partners is negative. In such a situation, one would then find an 
obstruction against the existence of metastable supersymmetry-breaking vacua that 
comes from the Goldstone partners rather than from the Goldstino partners. In 
the light of this possibility, it would be interesting to apply the result that we have 
derived to reexamine the conditions for the existence of metastable supersymmetry- 
breaking vacua in theories where the gauging plays a crucial role. One class of 
models where this could perhaps uncover new instabilities is that of theories with 
extended supersymmetry, and more specifically those where the Goldstino partners 
do not seem to lead necessarily to tachyons. This is for instance the case of N = 2 
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theories with non-Abelian vector multiplets and/or charged hyper multiplets. 

To conclude, we would like to comment on the generalization of our result to the 
case of supergravity theories. The only technical difficulty to extend our analysis to 
that case is the fact that the Goldstino direction /' and the Goldstone directions x l a 
are no longer orthogonal, as a consequence of the additional gravitational term in 
the definition of the auxiliary fields. More precisely, one gets g^f % xP a = ig~ 1 m 3 / 2 D a . 
As a consequence, the set of vectors f l and x l a can no longer be chosen to form an 
orthonormal set, although it still represents a complete set of dangerous directions. 
The restriction of the mass matrix to this subspace is then no longer given just by 
eq. (3.14) but by a more complex expression. As a result, the analysis becomes 
technically more complicated. But for the rest one can apply the same strategy we 
developed in this paper for theories with rigid supersymmetry. 
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A Explicit examples 

In this appendix, we study in some detail a few concrete examples to illustrate our 
general results. We focus on models with two fields and one gauge symmetry. In 
this situation, the Goldstino and Goldstone directions f l and x l are rigidly tied and 
can be parametrized with a single angle 6, which we shall define in such a way that 
the mass M is constant. Another simplification that occurs in the two-field case is 
that one simply has Q 2 f j = {Qff) 2 + |Q/s| 2 - We shall take 6 G [0, 27r], but in all 
the examples below the behaviors of the masses in the four quadrants are related 
by simple reflections. 

As a first simple example, let us discuss the case of quadratic Kahler potential 
and linear Killing vector, which corresponds to a flat scalar manifold with a phase 
isometry defined by positive charges: 

K = $ 1 <l 1 + $ 2 <5 2 , X i = -i (gi^ 1 , g 2 $ 2 ) . (A.l) 

In this case, we can parametrize the vacuum in the following way: 

$* = i f X M (q- 1 cos 9, q 2 l sin 6) . (A.2) 

V2 

The Goldstone and Goldstino directions are then given by x % = — i ( cos#, sin#) and 
/* = —i (sin#, — cos#), and the metric is clearly trivial: g^ = Sij. The relations 
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between \D\, \F\ and M 2 are in this case: 

| D | = \-g- x (a? cos 2 9 + qf sin 2 9 ) M 2 , (A. 3) 

I^H^^te^r^M 2 . (A.4) 

We then get: 

D 
~F 



'Sico^e + J^si^e. (A.5) 

Qi V <?2 



In this case -R^; - vanishes identically and we therefore get: 

R ffff = ° ' R ihx = , i?/// s = . (A.6) 

The matrix elements of Qij are instead given simply by: 



Qff = Q2 cos 2 9 + q t sin 2 9, (A.7) 



Qxx = <?i cos 2 9 + q 2 sin 2 , (A. 8) 

Qfx = (qi -g 2 )cos0sin0. (A.9) 

The elements m 2 j, m 2 s , m 2 ^ and the eigenvalues m^ of the mass matrix are equal 
to M 2 times some functions of 9 and (71/(72 • The behavior of m 2 JM 2 and m 2 _/M 2 
as functions of is shown in fig. 1 for some particular choice of (71/(72 ■ More in 
general, one finds the following behavior. If g x > q 2 , m 2 , and m 2 _ both reach their 
maxima for 9 = |, and at that point m 2 f JM 2 = (71/(72, m 1x/M 2 = | (2 + (71/(72) and 
m 2 s /M 2 = 0, so that rri^/M 2 = (71/(72- The optimal direction is therefore = |, 
and the bound is m 2 /M 2 = (71/(72- If instead (72 > <7i; the situation is similar but 
with q 1 <H- q 2 and -B- f - 0. 

As a second simple example, let us discus the case of logarithmic Kahler potential 
and constant Killing vector, which corresponds to a constantly and positively curved 
scalar manifold with a shift isometry defined by positive shifts: 

K = -A 2 log (— ^J " A 2 , log [-^^) . ^ = < K ^2) ■ (A-10) 

The two scales A x and A 2 define the curvatures of the two field sectors, whereas the 
two scales Ai and A 2 define the gauge shifts. It is then convenient to introduce the 
following dimensionless parameters: 

#Ai gA 2 gAi gA 2 

^1 — ITT- > ^2 = —TT- > °1 — ^~^ > a 2 = "^Tr" • (A. 11) 

In this case, we can parametrize the vacuum in the following way, by including 
absolute values to take into account that the fields are in this case restricted to have 
a positive real part: 

¥ = ^j=g- 1 M(a 1 X 1 \aece\, a 2 A 2 |csc0|). (A.12) 
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Figure 1: Plot of vn,JM (upper curve) and m_/M (lower curve) as 
functions of for the model with quadratic Kahler potential and linear 
Killing vectors defined by (A.l), with (71/92 = 3. 



The Goldstone and Goldstino directions are then given by x % = \J~2i (a\, a?) and 
f % = \f% (ai| tan#|, — a 2 \ cot 8\), whereas g^ = \ diag(a7 2 cos 2 8, a^sin 2 8). The 
relation between \D\, \F\ and M 2 are in this case: 



\D\ = ^^ 1 (Ai|cosfl| + A 2 |sinfl|)M 2 , 

v2 

|F| = ^^v/A^^cos^sin^l^^M 2 
V2 



(A.13) 
(A.14) 



We then get: 

D 
F 



■>/ 1 2 cos 8 sin 8 1 I W — | cos#| + -y / — — | sin#| 



The contractions of R ilk i are given by 



R 



ffff 



2g 2 M- 2 (X^ 2 cos 4 8 + X^sm 4 8) 



R ffsS = 2g 2 M- 2 (A 2 2 + A^ 2 ) cos 2 8 sin 2 8 , 

R fff . = 2g 2 M- 2 (A^ 2 sin 2 8 - X 2 2 cos 2 8) | cos 8 sin 8\ 



(A.15) 



(A.16) 
(A. 17) 
(A.18) 



The matrix elements of Q^ are instead found to be independent of the shifts a^ and 
dominated by the effect of the connection term in their definition, as a result of the 
fact that the Killing vectors are constant: 



Q f f= V2{\f\ cos 8\ + A1 1 1 sin 6»|) I cos 6» sin 6»| , 

Q xS = V2 (Al 1 1 cos 3 8\ + A 2 X | sin 3 8\) , 

Qfx = \/2 (A^ 1 ! cos 8 1 — A2 1 1 sin 8\) | cos 8 sin 8\ . 



(A.19) 
(A.20) 
(A.21) 
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The elements m 2 f r, m^, m^ 



! and the eigenvalues rn± of the mass matrix are equal 



to M 2 times some functions of 9 and A1/A2. The behavior of m 2 JM 2 and m 2 _/M 2 
as functions of 9 is shown in fig. 2 for some particular choice of A1/A2. More in 



general, one finds the following behavior. m 2 j reaches its maximum for 9 



f and 



at that point m 2 ff -/M 2 = 1 + ~(Ai/A 2 + A 2 /Ai), m 2 xS /M 2 = 1 + §(Ai/A 2 + A2/A1) and 
mj s /M 2 = — |(Ai/A 2 — A 2 /Ai), so that rn^/M 2 is smaller-or-equal than m 2 JM 2 . 
The maximum of m^/M 2 occurs instead for some 9 < j if Ai > A 2 and for some 
9 > j if Ai < A 2 , and takes a value that is smaller than 1 + |(Ai/A 2 + A 2 /Ai). For 
Ai ~ A 2 , the optimal direction is 9 ~ j and the bound is m 2 /M 2 ~ |, which is 
identical to the one that one would have obtained by looking just at the Goldstino 
direction. For Ai ^> A 2 , on the other hand, a numerical study shows that the optimal 
direction is 9 ~ 0.67 and the bound is m 2 /M 2 ~ 0.13Ai/A 2 , which is a factor 1.86 
smaller than the one that one would have inferred by looking just at the Goldstino 
direction, although still positive. For Ai <C A 2 , the situation is similar but with 
Ai <H- A 2 and 9 <B- f — 9. 



-1 - 




Figure 2: Plot of rnir/M 2 (upper curve) and m?_/M 2 (lower curve) 
as functions of 9 for the model with logarithmic Kahler potential and 
constant Killing vectors defined by (A. 10), with A1/A2 = t. 



As a third slightly more complicated and richer example, let us finally discus the 
case of logarithmic Kahler potential and linear Killing vector, which corresponds to 
a constantly and positively curved scalar manifold with a phase isometry defined by 
positive charges: 



K 



-A? tog (l-- S |-J - A^log fl-^rj , X i = ~i M\<?2$ 2 ) .(A.22) 



The two scales Ai and A 2 define as before the curvatures of the two field sectors. 
It turns out that by varying the overall scale of these curvatures with respect to 
the vector mass scale, this new model interpolates between the two previous ones. 
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This can be seen as follows. The small curvature limit corresponds to take A$ large 
and $* finite, so that $ l /Aj is close to 0. In this limit one can keep the same 
coordinates and just expand the logarithm in K. In this way one then recovers the 
model (A.l). The large curvature limit corresponds instead to take A, small and 
$* also small, so that $ l /Aj is close to 1. In this limit, it is convenient to change 
coordinates to describe the model in a more transparent way. The appropriate 
reparametrization turns out to be $*/Aj — > (1 — |$'/Aj)/(1 + |$*/Aj). Discarding 
an irrelevant Kahler transformation, one then finds K — > — J^ A 2 log(($ J + $*)/Aj) 
and X % — > ?gjAj(l — i$* 2 /A 2 ). In these new coordinates, $'/Aj is close to 0. 
In this limit one then manifestly recovers the model (A. 10) with the same field 
parametrization and shifts given by Ai = q{A{. To parametrize the effects of the 
curvatures, we introduce as before the dimensionless parameters 



Ai 



<?Ai 



A 5 



gM 



M ' M 

It will also be useful to introduce the short-hand notation 



u(9) = H 



cos 9 
<7iAi 



v{9) = H 



/sin0\ 

\g2A 2 / 



(A.23) 



(A.24) 



where H(x) is the following monotonically decreasing function: 



H(x) 



Vl + 2^-1 f 1 , \x\ < 1 
x2 ~ \ \ftl\x\ , |x| > 1 



(A.25) 



In this case, we can parametrize the vacuum in the following way: 

¥ = -^g^M (qfu{9) cos9, q 2 1 v(9) sin9) . 
V2 



(A.26) 



The Goldstone and Goldstino directions then read x % = —i (u(9)cos9, v(9)s'm9) 
and /* = —i (u(6) sin9, —v(9) cos9), and the metric is g^ = diag(l/-u 2 (6>), l/t> 2 (#)). 
The relation between \D\, \F\ and M 2 are in this case: 



D\ = V 1 [q 1 1 u{9) cos 2 9 + q 2 1 v(9) sin 2 9)M 2 



_ . , , qj 1 u(9) cos 2 9 + q 2 1 v(9) sin 2 9 

' ' "" 2 9 \l q 2 [2/v(6) - 1] cos 2 9 + qi [2/u(9) - l] sin 2 9 



M 2 . 



(A.27) 
(A.28) 



We then get: 



u(9) cos 2 9 + J— v(9) sin 2 9 
Qi V 12 




x , / , / — [2/v(9) - 1] cos 2 e + J— [2/u(9) - 1] sin 2 

<?1 ' V <?2 



(A.29) 
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The contractions of R^ki are given by 



R 

R 



ffff 



2g 2 M' 2 (A^ 2 + A^ 2 ) cos 2 9 sin 2 9 , 



R fff . = 2g 2 M- 2 (Xf sin 2 9 - X^ 2 cos 2 9) cos 9 sin 9 . 

The matrix elements of Qij are instead found to be: 

Q ff = q 2 [2/v(9) - 1] cos 2 9 + qi [2/u{9) - l] sin 2 9 , 
Q xS = Ql [2/u{9) - 1] cos 2 9 + q 2 [2/v{9) - l] sin 2 9 , 
Q /s = (gi [2/u{9) - 1] - q 2 [2/v{9) - l] ) cos 9 sin . 



(A.30) 
(A.31) 
(A.32) 



(A.33) 
(A.34) 
(A.35) 



The elements m 2 *, m 2 s , m 2 s 
M 2 



and the eigenvalues m\ of the mass matrix are equal to 

2 //\/f2 



times some functions of 9, A1/A2, ?i/?2 and gig2AiA2- The behavior of m.f/M 
and m 2 _/M 2 as functions of # is shown in fig. 3 for some particular choice of A1/A2, 
qi/q 2 and q\q 2 XiX 2 . More in general, one finds the following behavior. m 2 f and m 2 _ 
reach maxima for two different values of 9, and the maximal value of m 2 _ is always 
smaller than the maximal value of m 2 f t. This shows once again that the bound that 
one would have inferred by looking only at the Goldstino direction is weaker than 
the bound m 2 that one obtains by taking into account also the Goldstone direction. 
One moreover verifies that in the limit Aj ^> 1 one recovers the behavior of the 
model with quadratic K and linear X 1 with charges q^, whereas in the limit Aj ^C 1 
one reaches the behavior of the model with logarithmic K and constant X 1 with 
shifts Ai = gjAj. 




Figure 3: Plot of m 2 f ir/M 2 (upper curve) and m 2 _/M 2 (lower curve) 
as functions of 9 for the model with logarithmic Kahler potential and 
linear Killing vectors defined by (A. 22), with A1/A2 = ^, gi/<?2 = 3 and 
9ig , 2AiA 2 = 1. 
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